DocMining: A Cooperative Platform for Heterogeneous Document Interpretation According to User-Defined Scenarios

نویسندگان

  • Eric Clavier
  • Gérald Masini
  • Mathieu Delalandre
  • Maurizio Rigamonti
  • Karl Tombre
  • Joël Gardes
چکیده

This paper describes the DocMining platform, that is aimed at providing a general framework for document interpretation. The platform integrates processings that come from different sources and that communicate through the document. A task to be performed is represented by a scenario that describes the processings to be run, and each processing is associated with a contract that describes the parameters, data and results of the processing as well as the way it has to be run. A controller interprets the scenario and triggers each required processing at its turn. The architecture of the platform is flexible enough to allow users to create their own objects, integrate their own processings into the platform, design their own interfaces and define their own scenarios. All data (documents, scenarios, contracts, etc.) are represented in XML, to facilitate data manipulation and communication inside the platform. After a brief state of the art in section 1, section 2 presents an overview of the DocMining platform and section 3 its implementation. The next sections deal with an example of a complete scenario, for the extraction of graphical and textual parts from a gray level image: Section 4 explains how the scenario is run and shows the results it provides, section 5 describes its construction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features

Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...

متن کامل

Cooperative Control of Mobile Robots in Creating a Runway Platform for Quadrotor Landing

Multi-agent systems are systems in which several agents accomplish a mission in a cooperative manner. In this paper, a novel idea for the construction of a movable runway platform based on multi-agent systems is presented. It is assumed that an aerial agent (quadrotor) decides to make an emergency landing due to reasons such as a decrease in energy level or technical failure, while there is no ...

متن کامل

The Proteus Approach of Maintenance Work Flow Management

Equipment maintenance is an essential need for successful industrial enterprises. Several software applications are used today for initiation and management of maintenance activities. This paper introduces the PROTEUS integration platform for cooperative interactions between maintenance applications. It is necessary to define the order of these interactions, the so-called business logic workflo...

متن کامل

Cover Page for Paper: A WEB SERVICE-BASED PLATFORM FOR CSCW OVER HETEROGENEOUS END- USER APPLICATIONS Authors

In this paper, a flexible and extensible platform for computer supported cooperative work based on web service technology is presented. This platform, called HERMES, enables collaboration among users of heterogeneous applications over the web. Web services provide an open, standard communication infrastructure that eliminates dependencies on proprietary technologies and platforms. Heterogeneous...

متن کامل

Web pages ranking algorithm based on reinforcement learning and user feedback

The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003